# Local deployment

Jan Nano 8bit
Apache-2.0
Jan-nano-8bit is an 8-bit quantized version converted from the Menlo/Jan-nano model, optimized for the MLX framework and suitable for text generation tasks.
Large Language Model
J
mlx-community
188
1
Minicpm4 8B Q8 0 GGUF
Apache-2.0
MiniCPM4-8B-Q8_0-GGUF is a model converted from openbmb/MiniCPM4-8B to GGUF format via llama.cpp, suitable for local inference.
Large Language Model Transformers Supports Multiple Languages
M
AyyYOO
160
2
Chinda Qwen3 4b Gguf
Apache-2.0
Chinda LLM 4B is a cutting-edge Thai model launched by iApp Technology, built on the Qwen3-4B architecture, bringing advanced thinking capabilities to the Thai AI ecosystem.
Large Language Model
C
iapp
115
1
Qwen3 235B A22B 4bit DWQ
Apache-2.0
Qwen3-235B-A22B-4bit-DWQ is a 4-bit quantized version converted from the Qwen3-235B-A22B-8bit model, suitable for text generation tasks.
Large Language Model
Q
mlx-community
70
1
Qwen3 8B 4bit AWQ
Apache-2.0
Qwen3-8B-4bit-AWQ is a 4-bit AWQ quantized version converted from Qwen/Qwen3-8B, suitable for text generation tasks in the MLX framework.
Large Language Model
Q
mlx-community
1,682
1
Qwen3 30B A3B GGUF
The GGUF quantized version of Qwen3-30B-A3B, supporting multi-bit quantization, suitable for text generation tasks.
Large Language Model
Q
MaziyarPanahi
158.92k
3
Qwen3 30B A3B 4bit
Apache-2.0
Qwen3-30B-A3B-4bit is a 4-bit quantized version converted from Qwen/Qwen3-30B-A3B, suitable for efficient text generation tasks under the MLX framework.
Large Language Model
Q
mlx-community
2,394
7
Qwen3 0.6B GGUF
GGUF quantized version of Qwen3-0.6B, suitable for text generation tasks.
Large Language Model
Q
MaziyarPanahi
233.95k
2
Qwen3 14B MLX 4bit
Apache-2.0
Qwen3-14B-4bit is a 4-bit quantized version of the Qwen/Qwen3-14B model converted using mlx-lm, suitable for text generation tasks.
Large Language Model
Q
lmstudio-community
3,178
4
Oute TTS 500M
Apache-2.0
OuteTTS is a text-to-speech (TTS) model focused on the Turkish language, based on a 500M parameter scale, capable of converting Turkish text into natural speech.
Speech Synthesis Other
O
Karayakar
27
0
3b Ko Ft Research Release Q4 K M GGUF
Apache-2.0
This is a 3B-parameter language model optimized for Korean, converted to GGUF format for compatibility with llama.cpp.
Large Language Model Korean
3
freddyaboulton
165
0
Mistral Small 3.1 24b Instruct 2503 Hf GGUF
This is a GGUF format quantized version of the mrfakename/mistral-small-3.1-24b-instruct-2503-hf model, suitable for text generation tasks.
Large Language Model
M
MaziyarPanahi
137.78k
2
Gemma 3 4b Pt Q4 0 GGUF
This is a GGUF format model converted from Google's Gemma 3.4B parameter model, suitable for text generation tasks.
Large Language Model
G
ngxson
74
1
Llama 3.1 8B RainbowLight EtherealMix GGUF
This is a quantized version in GGUF format based on the Llama-3.1-8B-RainbowLight-EtherealMix model, which facilitates the development of applications related to text generation.
Large Language Model
L
MaziyarPanahi
101
1
Qwq 32B GGUF
GGUF format quantized version of QwQ-32B, suitable for local text generation tasks.
Large Language Model
Q
MaziyarPanahi
459.38k
3
MMS TTS THAI FEMALEV1
This is a Thai female voice text-to-speech (TTS) model, fine-tuned based on the VITS architecture, supporting high-quality Thai speech synthesis.
Speech Synthesis Other
M
VIZINTZOR
81
2
Indri 0.1 124m Tts GGUF
Indri is a text-to-speech (TTS) model supporting English and Hindi, with a parameter size of 124M, optimized for CPU inference in GGUF format.
Speech Synthesis Supports Multiple Languages
I
11mlabs
86
0
Gte Qwen2 7B Instruct GGUF
Apache-2.0
A large language model developed by Alibaba NLP team, based on the Qwen2 architecture with 7B parameters, supporting instruction interaction
Large Language Model
G
tensorblock
1,502
11
Mlx Stable Diffusion 3 Medium
Other
MLX implementation of Stable Diffusion 3 Medium, focused on text-to-image generation
Image Generation English
M
argmaxinc
238
2
Smollm 135M 4bit
Apache-2.0
This is a 4-bit quantized 135M parameter small language model, suitable for text generation tasks in resource-constrained environments.
Large Language Model Transformers English
S
mlx-community
312
1
Deepseek V2 Lite Chat GGUF
Other
DeepSeek-V2-Lite-Chat is a lightweight chat model optimized based on the DeepSeek-V2 architecture, suitable for efficient dialogue generation tasks.
Large Language Model Transformers
D
gaianet
1,334
1
Gemma 2 27b It Q8 0 GGUF
This is a GGUF format model converted from Google's Gemma 2B model, suitable for text generation tasks.
Large Language Model
G
KimChen
471
2
Qwen2 7B Instruct GGUF
The GGUF quantized version of Qwen2-7B-Instruct, suitable for local deployment and inference
Large Language Model
Q
MaziyarPanahi
1.5M
11
Llama 2 7b Ukrainian Q8 0 GGUF
This is a Ukrainian and English language model based on the Llama-2-7b architecture, converted to GGUF format for use with the llama.cpp framework.
Large Language Model Supports Multiple Languages
L
NikolayKozloff
18
2
Meta Llama 3 8B Instruct Q4 K M GGUF
Other
The GGUF quantized version of the Llama 3 8B instruction model, suitable for local inference and supporting efficient deployment
Large Language Model English
M
NoelJacob
1,131
1
Gemma 7B Instruct Function Calling
CC
Gemma is a series of lightweight cutting-edge open-source large language models launched by Google, developed based on the Gemini technology framework, supporting English text generation tasks.
Large Language Model Transformers
G
InterSync
17
6
Tinyllama 1.1B Chat V1.0 GGUF
Apache-2.0
TinyLlama is a lightweight 1.1B-parameter Llama model optimized for chat and programming assistance tasks.
Large Language Model English
T
andrijdavid
117
2
Pandora 13B V1
Apache-2.0
Pandora-v1-13B is a 13B-parameter large language model that integrates multiple 7B models, using the passthrough fusion method to combine the best-performing 7B models from the OpenLLM leaderboard.
Large Language Model Transformers English
P
jan-ai
92
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase